Random walks on complex networks with inhomogeneous impact 
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In many complex systems, for the activity fi of the constituents or nodes i a power-law relationship 
was discovered between the standard deviation ai and the average strength of the activity: ai oc 
{fi}"'; universal values a = 1/2 or 1 were found, however, with exceptions. With the help of an 
impact variable we introduce a random walk model where the activity is the product of the number 
of visitors at a node and their impact. If the impact depends strongly on the node connectivity and 
the properties of the carrying network are broadly distributed (like in a scale free network) we find 
both analytically and numerically non-universal a values. The exponent always crosses over to the 
universal value of 1 if the external drive dominates. 
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Complex systems usually consist of many interacting 
units such that their scaffold is a network. The most 
puzzling questions are usually the ones regarding the dy- 
namics of such systems. Examples range from traffic (ve- 
hicular or Internet) through the biochemistry of the cell 
to markets which are systems generated by human in- 
teractions. Due to its generality, the network aspect has 
turned out to be extremely fruitful in these studies, but 
the dynamics is usually rather individual, system depen- 
dent. However, recently an interesting unifying feature 
was found in systems with multichannel observations. 

Processes taking place in many complex systems can 
be characterized by generalized activities fi{t) > 0, de- 
fined at each constituent or node i = 1 . . . N. For a wide 
range of systems, research Q, Q |1| has revealed power- 
law scaling between the mean and the standard deviation 
of the activity of the nodes: 



where by definition 



(1) 



(2) 



This is not unmotivated from equilibrium statistical 
physics. Many physical systems belong to the class a = 
1/2; in most cases this is the fingerprint of equilibrium 
and the dominance of internal dynamics. Examples of 
such behavior are a computer chip or the hardware level 
Internet (the network of data transmission) 

One can show analytically the existence of a universal- 
ity class with a = 1. This value always prevails in the 
presence of a strong driving force, when the dominant 
factor is such externally imposed dynamics. This limit is 
found for river networks, highway traffic and the World 
Wide Web ^. 
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It is instructive to recall that although only different 
aspects of the same system, Internet and WWW fall into 
separate categories. Former has a robust internal activity 
even without outside (human) interaction, due to auto- 
matic queries, data transfer, etc. On the other hand, 
latter consists of web pages whose activity is generated 
by external demand (i.e., the clicks of users). 

For such an analysis multichannel monitoring of a large 
number of elements is needed with a possibly broad range 
of (fi). A previous work |2| introduced a method of de- 
composition to separate the effects of an external driv- 
ing force from the system's internal dynamics originating 
from the constituents' individual behavior and interac- 
tions. Furthermore, when external driving was absent or 
subordinate, a = 1/2 seemed to hold for every investi- 
gated system, though universality could not be proven. 
In fact, more recent studies have shown that there are 
exceptions from this rule: Fluctuations of stock market 
trading activity (traded volume times price) are char- 
acterized by q: w 0.72 0. In this paper we present a 
generalization of a random walk model by Menezes and 
Barabasi , which accounts for such anomalous a values 
and clarifies their microscopic origins. 

The model in Ref. was motivated by the statistics of 
web page visitations and we will also use this language. 
Let us take a scale free Barabasi-Albert network of N 
nodes (web pages) First, we distribute W tokens 

(users, walkers) on the nodes randomly. In every time 
step every walker jumps from its present site to a random 
neighbor of the site along an edge (link) . The new feature 
is that if a user steps to a site, it has to pay a certain 
value V{i) (exerts a certain impact) which depends on 
the degree fc(i) of the visited node (more popular pages 
with more links cost more money) as a power-law: 



V{i) oc fc(i)^. 



(3) 



We must emphasize, that although this is a simplifica- 
tion, the assumption is not ad hoc. E.g., the average 
activity for a given stock was found to scale with the ex- 
ponent ^ sa 0.44 as a function of the capitalization of the 
companies 0, which, in a sense, can be identified with 
the nodes' strength. This application will be discussed in 
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detail. 

We continue the diffusion for T steps and finally cal- 
culate the total profit of every web site, which equals the 
N{i) number of visitations multiplied by V{i); these val- 
ues will be the activities fi{t = 1) of the nodes on day 
1. We iterate this procedure for the same network for 
t = 1, . . . ,D days to generate the whole time series fi{t). 
The original variant presented in Ref. corresponds to 
the special case ^ = 0: it only counts N{i) for each node. 
Later we will address the case when the V{i) values are 
allowed to fluctuate. 

We expect a power-law scaling relationship such as 



(4) 



i.e., the mean number of the visitations to a node is pro- 
portional to its degree. Higher connected vertices should 
have higher traffic. Stationary solution of the master 
equation yields i> ^ 1 analytically, in line with Ref. 0. 
Also, if initially tokens are distributed uniformly, after 
the first step the expectation value of the number of to- 
kens at any node will be directly proportional to its de- 
gree. Therefore the model reaches the stationary state in 
one step. For the sake of generality we will keep the no- 
tation for a general value of ly, which could be generated 
by different dynamics. 

We can write the mean profit of node i as 



(5) 



Fluctuations can be expressed by the application of the 
central limit theorem. As walkers do not interact, their 
visits are independent, and thus for large enough (N{i)) 
and finite (^N{i)'^') the distribution converges to a Gaus- 
sian. Moreover, the variance of the visits at node i is 
= {{N{i) ~ (iV(i)))2) (X {N{i)) (X kii)". The vari- 
ance of the signal detected on node i can then be written 
as 



a%,Vii)^ cx kit) 



(6) 



where the proportionality comes from (jS)). Finally, one 
can combine ||SJ) and © to get 



1 



l-i/i^ + 1 



(7) 



a is the internal dynamical exponent defined by JQ) in 
the absence of external forces. 

We performed simulations of such a process and found 
perfect agreement with the above calculation. We fixed 
a Barabasi- Albert network of iV = 2000 nodes, W = 200 
tokens, T = 100 steps per day, and averaged over D = 
10^ days. We also varied /i = —0.5, . . . , 5.0. Examples 
for the scaling relation ^ arc shown in Fig. 

mU There 

is a clear dependence of the slope on the value of fi. The 
measured exponents a, compared with the analytical for- 
mula are shown in Fig. [21 

The r.h.s. of lO is governed by the single parameter 
n/u. By setting /i = 0, we recover the original, non- 
independent impacts and a = 1/2. If fi/v > 0, the 
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Figure 1: Scaling of the standard deviation of node activity 
witli the mean signal of the same node. A single point in the 
graph represents the average standard deviation of all nodes 
with approximately the same flux. Slopes on the log-log scale 
give the internal dynamical exponent a. Varying the impact 
distribution by changing jj, causes a continuous change in a, 
as expected from Q. 
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Figure 2; The values of the internal dynamical exponent a as 
a function of fi governing the distribution of node dependent 
impacts. Circles show simulation results (A'^ = 2000 nodes, 
W = 200 walkers, T = 100 steps/day, averaged for D = 10^ 
days). The solid line represents the analytical formula Q for 
v — 1. By setting ^ = 0, we recover a — 1/2, which is ob- 
served for several equilibrium systems. If one allows for node 
dependent impacts {fi 0), non- universal behavior emerges 
and a can change continuously between and 1. 



scaling exponent of fluctuations increases, a > 1/2. As 
ji/h' ^ oo, a —> 1, which is the same exponent but due 
to a different mechanism as a = 1 arising from strong 
driving. Note that by choosing fi/i/ < 0, a < 1/2 values 
are also accessible. 

This result is robust against fluctuations in V, i.e., if 
different users spend different amounts of money while 
visiting the same web page, provided 



V{i) = {V{i))-X (X k{iY -X. 



(8) 



X is drawn independently from a fixed distribution for 
every visitation of node i. It is also assumed to have a 
finite second moment. 
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The distributions of N{i) and V{i) are independent 
and so they factorize in jSJl, which formula hence remains 
unchanged. In order to prove that scahng suggested by 
© also persists, let us write 



N 



N 



\n=l 



\n=l 




(9) 



where n runs for all the N visits to site i during the day 
and Vi{ri) is the profit from the nth visit. By denoting 

'}2,n=i^ii''^) density function as P(V/v), and 

that of N{i) by P(A^), it is possible to rewrite (0) as 



= P(7V) / dVNnVN)V^ 

N=0 

(f^PiN) J dVNP{VN)VN] . (10) 



\N=0 



By both adding and 



the term 



subtracting 

EN=o^iN) {j;^ dVNnVN)VN)\ then applying the 
equality dVivP(VW)VAr = N {V{i)) and that for any 
fixed N the variance of Vn is ay^ 
finds 



Nay, one finally 



- 4, {N{i)) « k{i) 



(11) 



The final proportionality comes from similar arguments 
as in the case of In particular, we know that 

cr^j oc {N{i)) (X k{iY . On the other hand, with re- 
spect to scaling with the node degree k{i), we defined 
V{if (X k{if'', while also cr^ oc k{if''. 

One can see explicitly the new source that contributes 
to fluctuations. The first term, basically the same as 
before, comes from diffusive dynamics. The second, ad- 
ditional term is which describes the effect of visit to visit 
variations present in impacts V{i). Regardless of this 
more complicated structure, the scaling of af with the 
vertex degree k{i) is preserved, similarly to {fi). Thus the 
dynamical exponent a is unaffected. Simulations based 
on various distributions of X confirmed this calculation. 

Next, in order to analyze the behavior of the system 
under the influence of an external drive, we allowed day to 
day changes in the number of walkers W . Following Ref. 
[H, we introduced W{t) = {W) + Aiy(t), where ^W{t) 
is Gaussian white noise with standard deviation AW Q ■ 
This acts as an external driving force and contributes to 
fluctuations. It is known, that increasing ISW toward 
the strongly driven limit (AW^/ {W) ^ 1), any system 
displays a crossover to a = 1 as a sign of the growing 
dominance of exogenous behavior. We used the above 
set of parameters and varied AW/ (W) = 5 • 10~^, . . . , 15. 
For all values of the internal exponent Q we recovered 
this expected tendency, as shown in Fig. Note that the 
intermediate values above that given by Q but below 1 
are effective exponents, actual scaling breaks down due 
to the crossover between them. 



1.0 



0.8 



< 0.6 

8 



0.4 
0.2 




10' 



lO"' 10° 
AW/<W> 



10^ 



Figure 3: Measured values of q as a function of the relative 
strength of driving force (AW/ (W)) for several fixed values 
of (iV = 2000 nodes, W = 200 walkers, T = 100 steps/day, 
averaged for D = 10^ days). Although the internal value 
{AW/ {W) <^ 1) varies, all systems display the universal be- 
havior a = 1 in the exogenous limit {AW/ (W) S> 1). 



This approach can be reversed. Driving hides the mi- 
croscopic dynamics, because all systems display the uni- 
versal value a = 1. However, if it is possible to measure 
the internal exponent a{AW/ {W) <C 1), one can decide 
about the presence of impact inhomogeneity. It has been 
found 0, that for the hardware level Internet a = 1/2. 
In this case fi{t) is the data flow through node i at time 
t, {V{i)) is the mean size of the passing data packets and 
{N{i)) is their mean number per unit time. This shows, 
that across the nodes of this system (routers) only the 
number of packets varies, but their size does not (/i = 0). 
This homogeneous dynamics can be expected: As the 
same packets pass many computers, the mean of their 
sizes can well be independent of node degree. 

Although not readily represented as a network, a sim- 
ilar analysis can be carried out on stock market data 
[lOj . Here, fi{t) is the value of stocks of the company 
i bought/sold at time t, {N{i)) is the mean number of 
transactions per unit time, and V{i) is the value of stocks 
exchanged in a single trade. The role of degree (node size) 
is taken by company capitalization. In this case, it has 
been found that i/ « 0.39 and /i « 0.44 This is direct 
evidence for the existence of scaling proposed in (PJ . Ac- 
cordingly, a has the non-trivial value 0.72 due to the 
presence of inhomogeneous impacts. 

If the dynamics that generates the activities fi{t) is 
much slower than the method used to record them, one 
can observe the single events at each node. This happens, 
e.g., if we track each walker in our model. In this case, 
given the node size distribution k{i), it is straightforward 
to measure fi and v directly from fi{t) by their deflni- 
tions. Again, this has been possible for stock markets 
0, because there all individual trades are documented 
as so called tick-by-tick data. Other possibilities could 
be distributed computing or telephone networks, where 
events take a longer time, while logs of activity can be 
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written instantly. It would be interesting to check the 
validity of our assumptions in these networks too. 
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activity nodes, whereas only a few highly active nodes, it 
is straightforward to take equal size bins in log (fi) and 
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weight is given to all regimes in {fi). This method was 
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Of course the number of walkers W{t) is always non- 
negative. Thus when (W) + AW{t) < 0, we define W{t) 
= 0. 
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